Dicionário-Aberto: A Source of Resources for the Portuguese Language Processing

نویسندگان

  • Alberto Simões
  • Álvaro Iriarte Sanromán
  • José João Almeida
چکیده

In this paper we describe how Dicionário-Aberto, an online dictionary for the Portuguese language, is being used as the base to construct diverse resources that are relevant in the processing of the Portuguese language. We will briefly present its history, explaining how we got here. Then, we will describe the resources already available to download and use, followed by the discussion on the resources that are being currently developed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Processing and extracting data from an open dictionary of the Portuguese language

Synonyms dictionaries are useful resources for natural language processing. Unfortunately their availability in digital format is limited, as publishing companies do not release their dictionaries in open digital formats. Dicionário-Aberto (Simões and Farinha, 2010) is an open and free digital synonyms dictionary for the Portuguese language. It is under public domain and in textual digital form...

متن کامل

The Presence and Influence of English in the Portuguese Financial Media

As the lingua franca of the 21st century, English has become the main language for intercultural communication for those wanting to embrace globalization. In Portugal, it is the second language of most public and private domains influencing its culture and discourses. Language contact situations transform languages by the incorporations they make from other languages and Portugal has...

متن کامل

PAPEL: a lexical ontology for Portuguese

PAPEL (Palavras Associadas Porto Editora Linguateca) is a lexical resource for natural language processing (NLP) of Portuguese which is being built by Linguateca, based on processing a major commercial Portuguese dictionary, the Dicionário da Ĺıngua Portuguesa (DLP) [1] developed and owned by the largest Portuguese dictionary publisher, Porto Editora. There are known similar lexical resources f...

متن کامل

Some Language Resources and Tools for Computational Processing of Portuguese at INESC

In the last few years automatic processing tools and studies based on corpora have became of a great importance for the community. The possibility of evaluating and developing such tools and studies depends on the availability of language resources. For the Portuguese language in its several national varieties these resources are not enough to meet the community needs. In this paper some valuab...

متن کامل

Coreference Resolution for Portuguese: Person, Location and Organization

Coreference resolution is a process that consists in identifying the several forms that a specific entity may be referred to in a discourse. The automatic textual coreference resolution is a very important task in Natural Language Processing (NLP), because several others need its output. The present work shows a system for coreference resolution for Portuguese and its evaluation. The system sol...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012